Predicting Author Blog Channels with High Value Future Posts for Monitoring

نویسندگان

  • Shanchan Wu
  • Tamer Elsayed
  • William Rand
  • Louiqa Raschid
چکیده

The phenomenal growth of social media, both in scale and importance, has created a unique opportunity to track information diffusion and the spread of influence, but can also make efficient tracking difficult. Given data streams representing blog posts on multiple blog channels and a focal query post on some topic of interest, our objective is to predict which of those channels are most likely to contain a future post that is relevant, or similar, to the focal query post. We denote this task as the future author prediction problem (FAPP). This problem has applications in information diffusion for brand monitoring and blog channel personalization and recommendation. We develop prediction methods inspired by (naı̈ve) information retrieval approaches that use historical posts in the blog channel for prediction. We also train a ranking support vector machine (SVM) to solve the problem. We evaluate our methods on an extensive social media dataset; despite the difficulty of the task, all methods perform reasonably well. Results show that ranking SVM prediction can exploit blog channel and diffusion characteristics to improve prediction accuracy. Moreover, it is surprisingly good for prediction in emerging topics and identifying inconsistent authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction in Social Media for Monitoring and Recommendation

Title of dissertation: PREDICTION IN SOCIAL MEDIA FOR MONITORING AND RECOMMENDATION Shanchan Wu, Doctor of Philosophy, 2012 Dissertation directed by: Professor Louiqa Raschid Department of Computer Science Social media including blogs and microblogs provide a rich window into user online activity. Monitoring social media datasets can be expensive due to the scale and inherent noise in such data...

متن کامل

Predicting gender from blog posts

Blogs are informal, personal writings that people post on their own blog sites. Nowadays, blogging is an important online activity. People share blogs with their friends and family members. The topics of blog posting cover almost everything, ranging from personal life, political opinions, recipes, product reviews, or even just random rants. Although some bloggers review their biologically infor...

متن کامل

Estimating Number of Citations Using Author Reputation

We study the problem of predicting the popularity of items in a dynamic environment in which authors post continuously new items and provide feedback on existing items. This problem can be applied to predict popularity of blog posts, rank photographs in a photo-sharing system, or predict the citations of a scientific article using author information and monitoring the items of interest for a sh...

متن کامل

Future Link Prediction in the Blogosphere for Recommendation

The phenomenal growth in both scale and importance of social media such as blogs, micro-blogs and user-generated content, has created a need for tools that monitor information diffusion and make recommendations within these platforms. An essential element of social media, particularly blogs, is the hyperlink graph that connects various pieces of content. There are two types of links within the ...

متن کامل

Socio-temporal analysis of conversations in intra-organizational blogs

Blogs have been popular on the Internet for a number of years and are becoming increasingly popular within organizations as well. The analysis of blog posts is a useful way to understand the nature of expertise within the firm. In this paper we are interested in understanding the topics of conversations that evolve through blog posts and replies. While keywords within blog posts can be used to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011